Nested Kriging predictions for datasets with a large number of observations
نویسندگان
چکیده
This work falls within the context of predicting the value of a real function at some input locations given a limited number of observations of this function. The Kriging interpolation technique (or Gaussian process regression) is often considered to tackle such a problem but the method suffers from its computational burden when the number of observation points is large. We introduce in this article nested Kriging predictors which are constructed by aggregating sub-models based on subsets of observation points. This approach is proven to have better theoretical properties than other aggregation methods that can be found in the literature. Contrarily to some other methods it can be shown that the proposed aggregation method is consistent. Finally, the practical interest of the proposed method is illustrated on simulated datasets and on an industrial test case with 104 observations in a 6-dimensional space.
منابع مشابه
Kriging with External Drift in Model Localization
When modelling a large area, models that can take into a count the variation from the general mean in small sub-areas could perform better in prediction than a general model fitted to entire dataset. One method for adjusting the large-area models for such variation is kriging, in which the predictions are corrected with the aid of neighbouring observations. A variogram represents the spatial co...
متن کاملNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Estimation and prediction in spatial models with block composite likelihoods using parallel computing
A block composite likelihood model is developed for estimation and prediction in large spatial datasets. The composite likelihood is constructed from the joint densities of pairs of adjacent spatial blocks. This allows large datasets to be split into many smaller datasets, each of which can be evaluated separately, and combined through a simple summation. Estimates for unknown parameters as wel...
متن کاملTutorial on Fixed Rank Kriging (FRK) of CO2 Data
In this document, we describe Fixed Rank Kriging (FRK), an approach to the analysis of very large spatial datasets. Such datasets now arise in many fields; our focus is on satellite measurements of CO2. FRK predictors and standard errors can be computed rapidly, even for datasets with a million or more observations. FRK relies on a so-called spatial random effects (SRE) model, which assumes tha...
متن کاملComparison of Geographically Weighted Regression and Regression Kriging to Estimate the Spatial Distribution of Aboveground Biomass of Zagros Forests
Aboveground biomass (AGB) of forests is an essential component of the global carbon cycle. Mapping above-ground biomass is important for estimating CO2 emissions, and planning and monitoring of forests and ecosystem productivity. Remote sensing provides wide observations to monitor forest coverage, the Landsat 8 mission provides valuable opportunities for quantifying the distribution of above-g...
متن کاملImproved Univariate Microaggregation for Integer Values
Privacy issues during data publishing is an increasing concern of involved entities. The problem is addressed in the field of statistical disclosure control with the aim of producing protected datasets that are also useful for interested end users such as government agencies and research communities. The problem of producing useful protected datasets is addressed in multiple computational priva...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistics and Computing
دوره 28 شماره
صفحات -
تاریخ انتشار 2018